Information processing apparatus and information processing apparatus, and information processing system

ABSTRACT

Provided is an information processing apparatus and an information processing method, and an information processing system, each of which provides a service related to a broadcast-type moving picture content. 
     An information processing apparatus includes: a receiving unit that receives, from a first device, a notice of data including a location or acquisition method of information suggesting an action of a virtual character, the action corresponding to an event that occurs in a broadcast-type moving picture content; an acquisition unit that acquires the information on the basis of the data issued as a notice from the first device; a display unit capable of two-dimensional or three-dimensional display; and a control unit that controls driving of the virtual character, which is to be displayed using the display unit, on the basis of the information acquired by the acquisition unit.

TECHNICAL FIELD

The technology disclosed in the present description relates to aninformation processing apparatus and an information processingapparatus, and an information processing system, each of which providesa service related to a broadcast-type moving picture content.

BACKGROUND ART

It has been a long time since the television broadcasting services havebecome widespread. Moreover, recently, broadcast-type moving picturedistribution services using a network, such as Internet Protocol TV(IPTV) and Over-The-top (OTT), have also been becoming widespread.

On the other hand, use of systems, each of which interprets a verbal ortextual natural language input to infer a user's intention and executesan action such as an interaction with the user, has also been becomingwidespread. Such systems equipped with this type of interactionfunction, also called “agents” or “assistants”, are achieved asapplications to be executed on dedicated electronic devices orinformation terminals such as televisions and smartphones. Moreover, theinteraction function of this type of system is achieved using theartificial intelligence (AI) function, the back-end interaction engineconnected via the Internet or the like, and the like.

For example, a proposal has been made for a virtual assistant thatconnects to a television or a set top box to control reproduction of amedia and any other function (see Patent Document 1).

CITATION LIST Patent Document

-   Patent Document 1: Japanese Translation of PCT International    Application Publication No. 2017-530567

SUMMARY OF THE INVENTION Problems to be Solved by the Invention

An object of the technology disclosed in the present description is toprovide an information processing apparatus and an informationprocessing apparatus, and an information processing system, each ofwhich provides a service related to a broadcast-type moving picturecontent.

Solutions to Problems

A first aspect of the technology disclosed in the present description isan information processing apparatus including:

a receiving unit that receives a broadcast-type moving picture content;and

a notification unit that notifies a second device of data including alocation or acquisition method of information suggesting an action of avirtual character, the action corresponding to an event that occurs inthe moving picture content.

The information processing apparatus according to the first aspect is,for example, a television receiver that receives the moving picturecontent to be broadcast or streamed, and receives a trigger of the eventby StreamEvent, WebSocket, or the like.

Moreover, a second aspect of the technology disclosed in the presentdescription is an information processing method including:

a receiving step of receiving a broadcast-type moving picture content;

a displaying step of displaying the moving picture content on a displayunit; and

a notifying step of notifying, in response to an event that occurs inthe moving picture content, a second device of data including a locationor acquisition method of information suggesting an action of a virtualcharacter.

Furthermore, a third aspect of the technology disclosed in the presentdescription is an information processing apparatus including:

a receiving unit that receives, from a first device, a notice of dataincluding a location or acquisition method of information suggesting anaction of a virtual character, the action corresponding to an event thatoccurs in a broadcast-type moving picture content; and

a control unit that acquires the information on the basis of the dataissued as the notice from the first device, and controls driving of thevirtual character on the basis of the acquired information.

The information processing apparatus according to the third aspectreceives the notice from the first device as a television receiver thatreceives the broadcast-type moving picture content, and controls thedriving of the virtual character to be displayed on a display unitcapable of two-dimensional or three-dimensional display.

Moreover, a fourth aspect of the technology disclosed in the presentdescription is an information processing method including:

a receiving step of receiving, from a first device, a notice of dataincluding a location or acquisition method of information suggesting anaction of a virtual character, the action corresponding to an event thatoccurs in a broadcast-type moving picture content;

an acquiring step of acquiring the information on the basis of the dataissued as the notice from the first device; and

a controlling step of controlling driving of the virtual character onthe basis of information acquired in the acquiring step.

Furthermore, a fifth aspect of the technology disclosed in the presentdescription is an information processing system including:

a first device that receives a broadcast-type moving picture content;and

a second device that controls driving of a virtual character, in which

in response to receiving a trigger of the event, the first devicenotifies the second device of data including a location or acquisitionmethod of information suggesting an action of the virtual character, theaction corresponding to an event that occurs in the moving picturecontent, and

the second device acquires the information on the basis of the dataissued as a notice from the first device, and controls driving of thevirtual character.

However, the “system” mentioned herein refers to a logical assembly of aplurality of apparatuses (or functional modules which achieve a specificfunction), and it does not matter whether or not the respectiveapparatuses or functional modules are in a single housing.

EFFECTS OF THE INVENTION

In accordance with the technology disclosed in the present description,there can be provided the information processing apparatus and theinformation processing apparatus, and the information processing system,each of which automatically drives the virtual character in response tothe content of the broadcast-type moving picture content.

Note that the effects described in the present description are merelyexamples, and the effects of the present invention are not limitedthereto. Moreover, the present invention may exert additional effects inaddition to the above effects.

Still other objects, features, and advantages of the technologydisclosed in the present description will be apparent by a more detaileddescription based on the embodiments to be described later and theaccompanying drawings.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a diagram schematically illustrating a configuration exampleof an interaction system 100.

FIG. 2 is a diagram illustrating a configuration example of a displayapparatus 110.

FIG. 3 is a diagram schematically illustrating a configuration exampleof an interaction apparatus 120.

FIG. 4 is a diagram illustrating a functional configuration example ofan interaction engine front end 410 and an interaction engine back end420.

FIG. 5 is a diagram illustrating an example of an operation sequence inthe interaction system 100.

FIG. 6 is a diagram illustrating an example of scene description data.

FIG. 7 is a diagram illustrating another example of the scenedescription data.

FIG. 8 is a flowchart illustrating a basic operation example of thedisplay apparatus 110.

FIG. 9 is a flowchart illustrating a procedure of a processing operationto be implemented by the interaction apparatus 120 in a receiving state.

FIG. 10 is a flowchart illustrating a processing operation of theinteraction apparatus 120 in a scene-following conversation mode.

FIG. 11 is a diagram exemplifying a mixed reality space in which animage of a virtual character appears in a room by an MR device.

FIG. 12 is a diagram illustrating a state in which the virtual characteris displayed on a sub-screen provided on a main screen that displays amain part of a broadcast program.

FIG. 13 is a diagram illustrating a state in which a pet-type robot isautonomously driven in response to a content of a broadcast-type datacontent that is being viewed thereby together with a user.

MODE FOR CARRYING OUT THE INVENTION

A detailed description will be given below of embodiments of atechnology disclosed in the present description with reference to thedrawings.

FIG. 1 schematically illustrates a configuration example of aninteraction system 100 to which the technology disclosed in the presentdescription is applied. The illustrated interaction system 100 includesa display apparatus 110 and an interaction apparatus 120.

It is basically assumed that the display apparatus 110 and theinteraction apparatus 120 are used by the same user. For example, thedisplay apparatus 110 and the interaction apparatus 120 are installed ina space such as a living room where the user stays. The displayapparatus 110 displays a broadcast or streamed content, and the userviews the content displayed by the display apparatus 110. Moreover, theinteraction apparatus 120 interacts with the user, receives a command ofspeech, gesture, or the like from the user, and presents information tothe user.

The display apparatus 110 is interconnected to an external network suchas the Internet, for example, via a router. Moreover, the interactionapparatus 120 is interconnected to the external network such as theInternet via an access point installed indoors.

Furthermore, it is assumed that the display apparatus 110 and theinteraction apparatus 120 are connected to each other via communicationmeans (not shown). The communication means may be either wired orwireless. For example, the display apparatus 110 and the interactionapparatus 120 may be connected to each other using communication basedon the existing communication standard such as Ethernet (registeredtrademark), Wi-Fi (registered trademark), and Bluetooth (registeredtrademark), or may be uniquely connected to each other. Moreover, onlyone-way communication from the display apparatus 110 to the interactionapparatus 120 may be performed using infrared communication or othersimple communication means.

The display apparatus 110 is equipped with a large screen that displaysa broadcast-type moving picture content. The display apparatus 110includes, for example, a television receiver that selects and receives abroadcast signal; however, may be a display connected to a set top box.The broadcast signal may be either terrestrial or satellite. Moreover, abroadcasting service to be used by the display apparatus 110 is notlimited to television broadcasting, and for example, a broadcast-typemoving picture distribution service such as IPTV and OTT using a networkcan be included therein. In the latter case, the display apparatus 110can include a display equipped with a network interface card. As amatter of course, a moving picture content provided by a push-basedcontent distribution service other than these may be included.

FIG. 2 illustrates a configuration example of the display apparatus 110.The display apparatus 110 includes a main control unit 201, a bus 202, astorage unit 203, a communication interface (IF) unit 204, an extensioninterface (IF) unit 205, a tuner/demodulator unit 206, a demultiplexer(DEMUX) 207, a video decoder 208, a speech decoder 209, a superimposedcharacter decoder 210, a subtitle decoder 211, a subtitle synthesis unit212, a data decoder 213, a cache unit 214, an application (AP) controlunit 215, a browser unit 216, a sound source unit 217, a video synthesisunit 218, a display unit 219, a speech synthesis unit 220, a speechoutput unit 221, and an operation input unit 222.

The main control unit 201 includes, for example, a cetral processingunit (CPU), a read only memory (ROM), and a random access memory (RAM),and controls the entire display apparatus 110 in accordance with apredetermined operating program. The ROM is a non-volatile memory inwhich a basic operating program such as an operating system (OS) andother operating programs are stored. Operation setting values requiredfor the operation of the display apparatus 110 may be stored in the ROM.The RAM serves as a work area when the OS and the other operatingprograms are executed. The bus 202 is a data communication path fortransmitting/receiving data between the main control unit 201 and eachunit in the display apparatus 110. Note that, in the present embodiment,it is assumed that trigger distribution synchronized with an event inthe moving picture content is performed from the broadcasting service(broadcasting station or stream distribution server) side (describedlater); however, the main control unit 201 can receive a result ofdecoding the trigger distribution from the data decoder 213 (describedlater).

The storage unit 203 includes a non-volatile storage device such as aflash ROM, a solid state drive (SSD), and a hard disc drive (HDD). Thestorage unit 203 stores the operating program and operation settingvalues of the display apparatus 110, personal information of the userwho uses the display apparatus 110, and the like. Moreover, the storageunit 203 stores an operating program downloaded via the Internet,various data created by the operating program, and the like Furthermore,the storage unit 203 is also able to store contents such as movingpictures, still images, and sounds, which are acquired through broadcastwaves and the Internet.

The communication interface unit 204 is connected to the Internet via arouter (mentioned above) or the like, and transmits/receives datato/from each server apparatus or other communication devices on theInternet. Moreover, it is assumed that the communication interface unit204 also acquires a data stream of a program transmitted via acommunication line. The communication interface unit 204 may beconnected to the router by either a wired connection such as Ethernet(registered trademark) or a wireless connection such as Wi-Fi(registered trademark). Furthermore, the communication interface unit204 may include communication means with the interaction apparatus 120.The communication means with the interaction apparatus 120 may be onlyone-way communication to the interaction apparatus 120.

The tuner/demodulator unit 206 receives a broadcast wave such as aterrestrial broadcast or a satellite broadcast via an antenna (notshown), and tunes (selects) to a channel of a service (broadcastingstation or the like), which is desired by the user, on the basis of thecontrol of the main control unit 201. Moreover, the tuner/demodulatorunit 206 demodulates the received broadcast signal to acquire abroadcast data stream. Note that the display apparatus 110 may have aconfiguration of mounting a plurality of tuner/demodulator units (thatis, multiple tuners) for the purpose of simultaneously displaying aplurality of screens, recording a competing program, or the like.

On the basis of a control signal in the received broadcast data stream,the demultiplexer 207 distributes a video data stream, a speech datastream, a superimposed character data stream, and a subtitle datastream, which are elements to be presented in real time, to the videodecoder 208, the speech decoder 209, the superimposed character decoder210, and the subtitle decoder 211, respectively. Data input to thedemultiplexer 207 includes data from the broadcasting service and thedistribution service such as IPTV and OTT. The former is input to thedemultiplexer 207 after being selected and demodulated by thetuner/demodulator unit 206, and the latter is input to the demultiplexer207 after being received by the communication interface unit 204.

Moreover, the demultiplexer 207 reproduces a multimedia application andfile-system data that is a component thereof, outputs the data to theapplication control unit 215, or temporarily stores the data in thecache unit 214. Moreover, the demultiplexer 207 extracts general-purposedata and outputs the general-purpose data to the data decoder 213 inorder to use the general-purpose data for streaming data for data or anapplication for use in a player that presents data other than the abovevideo, speech, and subtitle.

The video decoder 208 decodes the video data stream, which is receivedfrom the demultiplexer 207, and outputs video information. Moreover, thespeech decoder 209 decodes the speech data stream, which is receivedfrom the demultiplexer 207, and outputs speech information. Furthermore,in order to simultaneously perform decoding processes for a plurality oftypes of video data streams and speech data streams, the displayapparatus 110 may include pluralities of the video decoders 208 and thespeech decoders 143.

The superimposed character decoder 210 decodes the character super datastream, which is received from the demultiplexer 207, and outputssuperimposed character information. The subtitle decoder 211 decodes thesubtitle data stream, which is received from the demultiplexer 207, andoutputs subtitle information. The subtitle synthesis unit 212 performs asynthesis process for the superimposed character information, which isoutput from the superimposed character decoder 210, the subtitleinformation, which is output from the subtitle decoder 211, and thesubtitle synthesis unit 212 with one another.

The data decoder 213 decodes a data stream that is multiplexed with avideo and a speech into an MPEG-2 TS stream. In the present embodiment,the data decoder 213 notifies the main control unit 201 of a result ofdecoding a general-purpose event message stored in a descriptor area ofa program map table (PMT) that is one of program specific information(PSI) tables. Moreover, the data decoder 213 decodes data transmittedusing WebSocket, and notifies the main control unit 201 of the decodeddata. Specifically, trigger distribution synchronized with the event inthe moving picture content is performed using StreamEvent, WebSocket, orthe like, and the data decoder 213 notifies the main control unit 201 ofa result of decoding the trigger distribution.

The application control unit 215 receives control information, which isincluded in the broadcast data stream, from the demultiplexer 207, oracquires control information from the server apparatus on the Internet200 via the communication interface unit 204, and interprets thesepieces of the control information.

In accordance with an instruction of the application control unit 215,the browser unit 216 presents a multimedia application file, which isacquired from the server apparatus on the Internet via the cache unit214 or the communication interface unit 204, and a file-system data thatis a component thereof. The multimedia application file referred to hereis, for example, a hyper text markup language (HTML) document, abroadcast markup language (BML) document, or the like. Moreover, it isassumed that the browser unit 216 also reproduces speech information ofthe application by acting on the sound source unit 217.

The video synthesis unit 218 receives the video information output fromthe video decoder 208, the subtitle information output from the subtitlesynthesis unit 212, and such application information output from thebrowser unit 216, and performs a process for appropriately selecting orsuperimposing the same. The video synthesis unit 218 includes a videoRAM (not shown), and display drive of the display unit 219 isimplemented on the basis of video information input to the video RAM.Moreover, on the basis of the control of the main control unit 201,according to needs, the video synthesis unit 218 also performs asuperimposition process for superimposing screen information such as anelectronic program guide (EPG) screen and graphics generated by theapplication executed by the main control unit 201.

The display unit 219 is a display device including, for example, aliquid crystal display or an organic electro-luminescence (EL) display,and presents, to the user, video information selected or subjected tothe superimposition process by the video synthesis unit 218.

The speech synthesis unit 220 receives the speech information outputfrom the speech decoder 209 and the speech information of theapplication, the speech information being reproduced by the sound sourceunit 217, and performs a process such as appropriate selection andsynthesis thereof.

The speech output unit 221 includes one or a plurality of speakers. Thespeech output unit 221 may be a speaker array (multi-channel speaker orultra multi-channel speaker) in which a plurality of speakers iscombined with one another, or a panel speaker. The speech output unit221 presents, to the user, speech information processed by the speechsynthesis unit 220.

The extension interface unit 205 is a group of interfaces for extendingthe functions of the display apparatus 110, and includes, for example,an analog video/speech interface, a universal serialbus (USB) interface,a memory interface, and the like. The extension interface unit 205 mayinclude a digital interface including a digital visual interface (DVI)terminal, an HDMI (registered trademark) terminal, a Display Port(registered trademark) terminal, and the like.

The operation input unit 222 is an instruction input unit by which theuser performs input of an operation instruction to the display apparatus110. The operation input unit 222 includes, for example, a remotecontrol reception unit that receives a command transmitted from a remotecontroller (not shown) and an operation key in which button switches arearranged. Moreover, the operation input unit 222 may include a touchpanel superimposed on a screen of the display unit 219. Furthermore, theoperation input unit 222 may include an external input device such as akeyboard connected to the extension interface unit 205.

As well as the television receiver, the display apparatus 110 may be adisc drive recorder such as a Blu-ray (registered trademark) discrecorder and an HDD recorder, a set top box (STB), a multifunctionalinformation terminal such as a personal computer and a tablet, anavigation apparatus, a gaming machine, and the like, each of which isequipped with a digital broadcast reception function and a broadcastcommunication cooperation function.

A description will be given while referring to FIG. 1 one more time. Theinteraction apparatus 120 is a device that interprets a verbal ortextual natural language input to infer a user's intention and achievesinteraction with the user, and is equipped with a function also called aso-called “agent” or “assistant”.

It is basically assumed that the interaction apparatus 120 is equippedwith a sub-screen for displaying a virtual character that expresses abehavior of the “agent” or “assistant” (FIG. 1 illustrates a state inwhich a video of a virtual character is displayed on the sub-screen ofthe interaction apparatus 120). The interaction apparatus 120autonomously drives this virtual character in response to an interactionwith the user. It is preferable that the sub-screen include a screenhaving a 3D display function, for example, like a light field display orcapable of realistic video expression, and can display the virtualcharacter three-dimensionally. Note that a light field refers to avisible space including all the light within the field of view of anobserver, and the light field display is ideally a display apparatusthat reproduces all the light that enters the observer's eyes. However,the sub-screen of the interaction apparatus 120 just needs to have onlya 2D display function.

Alternatively, the interaction apparatus 120 may be a mixed reality (MR)device such as a head-mounted display that displays a video of thevirtual character in a complex manner in a real space. FIG. 11exemplifies a mixed reality space in which the video of the virtualcharacter appears in a room by the MR device.

Alternatively, on the television receiver for use as the displayapparatus 110, an application for autonomously driving the virtualcharacter that interacts with the user may be started, a child screenmay be provided in the screen of the display unit 219, and the virtualcharacter may be displayed thereon. FIG. 12 is a diagram illustrating astate in which the virtual character is displayed on such a sub-screenprovided on a main screen that displays a main part of a broadcastprogram.

Alternatively, the interaction apparatus 120 may be a pet-type robotinstead of the device equipped with the sub-screen that displays thevirtual character three-dimensionally, and may express a behavior as an“agent” or “assistant” by joint movement of the robot. In this case, theinteraction apparatus 120 responds to an interaction with the user, andthe robot autonomously drives a movable portion such as a joint. FIG. 13illustrates a state in which the pet-type robot is autonomously drivenin response to a content of a broadcast-type data content that is beingviewed thereby together with the user.

Alternatively, it is also assumed that an application of the “agent” orthe “assistant” is started on an information terminal such as asmartphone and tablet owned by the user and is used as the interactionapparatus 120. In this case, the virtual character is displayed on ascreen of the information terminal or a display externally connected tothe information terminal. Moreover, it is also assumed that theapplication is started on an information home appliance installed in aroom where the display apparatus 110 is present, and is used as theinteraction apparatus 120.

In short, the interaction apparatus 120 is basically a device equippedwith a 3D display function;

however, is also replaceable with a device of another form. In thepresent embodiment, it is assumed that, in any form, the interactionapparatus 120 views the broadcast-type moving picture content, which isdisplayed on the display apparatus 110, together with the user.

FIG. 3 schematically illustrates a configuration example of theinteraction apparatus 120. However, in FIG. 3, a dedicated device suchas a speech agent is assumed as the interaction apparatus 120. Theinteraction apparatus 120 includes a processing unit 301, acommunication unit 302, a display unit 303, a speech input unit 304, aspeech output unit 305, a sensor unit 306, and a recording unit 307.

The communication unit 302 connects to an external network such as theInternet via an access point (see FIG. 1) using wired communication suchas Ethernet (registered trademark) or wireless communication such asWi-Fi (registered trademark). Moreover, the communication unit 302 maybe interconnected to each home CE device via a home network, forexample, in accordance with a standard such as digital living networkalliance (DLNA) (registered trademark), or may further include aninterface function with an Internet of Things (IoT) device.

It is assumed that, in the interaction system 100 according to thepresent embodiment, the display unit 303 functions as a “sub-screen” forthe main screen of the display apparatus 110 to display the virtualcharacter that expresses the behavior of the “agent” or “assistant”.More preferably, the display unit 303 includes a screen that has a 3Ddisplay function, for example, such as the light field display(mentioned above) or is capable of realistic video expression, and candisplay the virtual character three-dimensionally. As a matter ofcourse, the display unit 303 just needs to have only a 2D displayfunction. However, the display unit 303 can also include an MR device orother display externally connected to the interaction apparatus 120.

The speech input unit 304 includes a sound collecting element such as amicrophone, and is used for receiving speech generated in a room inwhich the interaction apparatus 120 is installed. As the speechgenerated in the room, utterances by a viewer of a television program ora user who uses a speech agent can be mentioned. The speech input unit304 may include a microphone array in which a plurality of microphonesis combined with one another.

The speech output unit 305 includes an audio generating element such asa speaker. A cone-type speaker, a flat panel-type speaker, or the likecan be used for the speech output unit 305. Moreover, the speech outputunit 305 may include a speaker array (multi-channel speaker or ultramulti-channel speaker) in which a plurality of speakers is combined withone another. The audio output unit 305 is used for outputting asynthetic speech of a speech agent function, for example.

The sensor unit 306 senses, for example, environmental information inthe room where the interaction apparatus 120 is installed. Theconfiguration of the sensor unit 306, that is, what kind of sensorelement is included therein is arbitrary. For example, the sensor unit306 may include a camera, an object detection sensor, and a depthsensor. Moreover, the sensor unit 306 may include an environment sensorthat detects environmental information, such as an illuminance sensor, atemperature sensor, and a humidity sensor. Furthermore, the sensor unit306 may include an infrared sensor or a human sensor. Moreover, thesensor unit 306 may include a biosensor that detects user's pulse,sweating, brain wave, myogenic potential, exhalation, and the like.Furthermore, the sensor unit 306 may include an inertial measurementunit (IMU) to detect a posture of a main body of the interactionapparatus 120 or the like.

The recording unit 307 includes, for example, a large-capacity recordingdevice such as an HDD and an SSD, and is used for recording variousdata. The recording unit 307 is disposed in the interaction apparatus120, and further, may be externally connected to the interactionapparatus 120 via an interface such as a USB.

The processing unit 301 includes a processor and a memory, executes aprogram loaded in the memory, implements various processes, andcomprehensively controls the operation in the interaction apparatus 120.In the processing unit 301, basically, various applications are executedunder an execution environment provided by the OS. For example, in acase where a multiprocessor is usable, or in a case where multithreadexecution is possible by the OS, all units of processes executable inparallel can be read into the memory and executed in parallel.

Among the applications to be executed by the processing unit 301, therecan be mentioned an interactive application for achieving an interactionfunction with the user, such as a speech agent. Moreover, parallelexecution of a plurality of the interactive applications by theprocessing unit 301 also makes it possible to cause the interactionapparatus 120 to function as a plurality of agent devices. In thepresent embodiment, it is assumed that the interactive applicationpresents the virtual character on the display unit 303.

The interaction apparatus 120 may be equipped with an AI function inorder to achieve a function to fully interact with the user. Moreover,the interaction apparatus 120 may be configured not as a stand-aloneapparatus as illustrated in FIG. 1 or the like, but as a front end of aninteraction engine, and may be configured to operate under control of aback end by an interaction engine including a server or a cloud on theInternet.

FIG. 4 illustrates a functional configuration example of the interactionengine front end 410 and the interaction engine back end 420.

The interaction engine front end 410 is an interactive application thatis disposed on the local side (that is, near the user) and is executedby the interaction apparatus 120, and can take advantage of therespective function modules of the speech input unit 304, the speechoutput unit 305, and the like, which are equipped in the interactionapparatus 120.

Meanwhile, a cloud side includes the interaction engine back end 420 anda plurality of external agent services 430A, 430B, 430C, . . . Theinteraction engine back end 420 and the external agent services 430A,430B, 430C, . . . all provide agent services which answer an inquiryfrom the user on the local side. However, while the former cooperatesdirectly with the interaction engine front end 410, the latter does notwork directly with the interaction engine front end 410, but indirectlyprovides an agent service to the interaction engine front end 410through the intervention of the interaction engine back end 420, andaccordingly, is called an “external agent service” to be distinguishedfrom the interaction engine back end 420. Moreover, at least a part ofthe external agent services 430A, 430B, 430C, . . . (the external agentservice 430C in the example illustrated in FIG. 4) directly cooperateswith an external agent device 440 installed on the local side.

The interaction engine back end 420 includes a speech recognition unit421, a semantic analysis unit 422, a speech synthesis unit 423, and acontrol unit 424.

The speech recognition unit 421 performs speech recognition for a user'sutterance collected by the speech input unit 304 on the interactionengine front end 410 side and converts the user's utterance into textinformation. The semantic analysis unit 332 performs semantic analysisfor data of the user's utterance on the basis of the text informationsubjected to the speech recognition. Moreover, the speech synthesis unit423 converts, into speech data, text information that serves as ananswer or a feedback to the user. The speech data is sent to theinteraction engine front end 410, and is output as a speech to the userfrom the speech output unit 305. Note that the interaction engine frontend 410 may give the user the feedback by an action or behavior of thevirtual character, which is displayed on the screen of the display unit303, in addition to such a speech output.

The control unit 424 executes a process for interacting with the user ofthe interaction engine front end 410. For example, the control unit 424searches for information inquired by the user, acquires a contentrequested by the user, and places an order for a product. In order toachieve a superior answer for the inquiry from the user, the controlunit 424 may be equipped with a learning function or an AI function. Thecontrol unit 424 may execute a part (or all) of the process forgenerating the virtual character to be displayed on the screen of thedisplay unit 303.

Moreover, the control unit 424 can further take advantage of resourcesof the external agent services 430A, 430B, 430C, . . . in order toachieve a superior interaction with the user. The control unit 424 maycall all the usable external agent services 430A, 430B, 430C, . . . , ormay select and call some of the external agent devices.

Moreover, several methods for calling the external

SYP330320W001 agent services 430A, 430B, 430C, . . . from theinteraction engine back end 420 are conceived. For example, textinformation obtained by semantically analyzing the inquiry from the userby the semantic analysis unit 422 may be transmitted to the externalagent service 430B, or speech data obtained by performing speechsynthesis for the text information by the speech synthesis unit 423 maybe transmitted to the external agent service 430A to make an inquiry ina form like a speech input from the user. Alternatively, in a case wherethe interaction engine back end 420 makes a request to the externalagent service 430C, such a procedure can also be adopted, in which thespeech data obtained by the speech synthesis by the speech synthesisunit 423 is transmitted to the interaction engine front end 410, aspeech like an inquiry of the user is output from the speech output unit305 of the interaction apparatus 120, and an inquiry is made through theexternal agent device 440.

Furthermore, several methods for responding to the agent system 100 fromthe external agent services 430A, 430B, 430C, . . . which are externalresources are also conceived. For example, the external agent service430B sends a processing result including text information or the like(or before the speech synthesis) to the interaction engine back end 420.Moreover, to the interaction engine back end 420, the external agentservices 430A and 430C send speech data obtained by performing speechsynthesis for the processing result. The interaction engine back end 420performs, by the speech recognition unit 421, speech recognition for thespeech data sent from the external agent services 430A and 430C, andsemantically analyzes a result of the speech recognition by the semanticanalysis unit 422.

Then, the control unit 424 aggregates the answers from the respectiveexternal agent services 430A, 430B, 430C, . . . at a level of a resultof the semantic analysis, and generates an answer sentence for the user.A method for aggregating a plurality of the answers is arbitrary. Ananswer from a specific external agent service may be selected against ananswer obtained by the interaction engine back end 420 itself, or ananswer from a specific external agent service may be selected on thebasis of a predetermined selection criterion such as a majority vote, oranswers from a plurality of the external agent services may besynthesized with one another to generate one answer sentence. In anycase, the speech synthesis unit 423 converts the answer sentence, whichis generated by the control unit 424, into speech data, and sends thespeech data to the interaction engine front end 410. Then, in theinteraction engine front end 410, a speech of the answer to the user isoutput as a speech to the user from the speech output unit 305.

Functional configurations of the respective external agent services430A, 430B, 430C, . . . are substantially similar to one another. Thefunctional configuration of the external agent service 430C will bedescribed below as a representative.

The external agent service 430C includes a speech recognition unit 431C,a semantic analysis unit 432C, a processing unit 433C, and a speechsynthesis unit 434C.

The speech recognition unit 431C performs speech recognition for anutterance collected by a speech input unit 441 of the speech agentdevice 440, and converts the utterance into text information. Note thatit is also assumed that the speech collected by the speech input unit441 is a speech output from the speech output unit 305 of theinteraction engine front end 410 as well as the user's utterance.

The semantic analysis unit 432C performs semantic analysis for data ofthe user's utterance on the basis of the text information subjected tothe speech recognition. The processing unit 433C executes a process foranswering an inquiry from the user. For example, when it is found fromthe result of the semantic analysis that a “start word” that specifiesthe external agent service 430C is input as a speech from the user, theprocessing unit 433C starts execution of the process.

The speech synthesis unit 434C converts, into speech data, textinformation that serves as the answer to the user. The speech data issent to the speech agent device 440, and is output as a speech to theuser from a speech output unit 442. Note that such a case is alsoassumed where the speech data generated by the speech synthesis unit434C is transmitted to the interaction engine back end 420 and is inputto the speech recognition unit 421 as well as that the speech data istransmitted to the speech agent device 440 and is output as a speech.

B. Specific Action of Virtual Character

In the interaction system 100 according to the present embodiment, thedisplay apparatus 110 that displays the broadcast-type moving picturecontent is positioned as the main screen, and meanwhile, the displayunit 303 equipped in the interaction apparatus 120 is positioned as thesub-screen. Then, the interaction apparatus 120 autonomously drives thevirtual character on the display unit 303 as the sub-screen. Morespecifically, while the interaction apparatus 120 has a function tointeract with the user, when the interaction apparatus 120 views abroadcast-type moving picture content, which is displayed on the mainscreen of the display apparatus 110, together with the user, the virtualcharacter on the sub-screen is autonomously driven to switch a facialexpression, gesture, behavior, and action of its own in response to acontent of the broadcast-type moving picture content.

The interaction system 100 according to the present embodiment has amain feature in that the interaction apparatus 120 autonomously drivesthe virtual character on the sub-screen in response to the content ofthe broadcast-type moving picture content that is being viewed togetherwith the user.

Moreover, the interaction system 100 according to the present embodimenthas another main feature in that information for suggesting autonomousdriving of the virtual character is provided to the interactionapparatus 120 from the outside. In the present description, suchinformation suggesting the autonomous driving of the virtual characteris referred to as “scene description data”. For example, a service sidesuch as a broadcasting station or a stream distribution server maycreate or edit scene description data for a moving picture content to bebroadcast or distributed, and may upload the scene description data to apredetermined website in advance.

For example, every time a broadcast channel is switched on the displayapparatus 110 side, the interaction apparatus 120 switches the virtualcharacter, which is to be displayed, on the basis of newly providedscene description data. Specifically, for each broadcasting station, theinteraction apparatus 120 may switch the virtual character prepared onthe service side.

Moreover, the interaction apparatus 120 may be switched to a virtualcharacter dedicated to a program on the basis of the newly providedscene description data every time not the broadcast channel but abroadcast program is switched.

Furthermore, the interaction apparatus 120 may switch a virtualcharacter, which is prepared on a distribution service side, on thebasis of the newly provided scene description data every time adistribution source of the content is switched or a streaming content tobe received is switched. Moreover, the interaction apparatus 120 mayswitch the virtual character, which is prepared on the distributionservice side, not for each distribution source but for each distributioncontent.

Moreover, even in the same broadcast program or the same streamingcontent, the interaction apparatus 120 may switch the virtual characteror control an action, which is expressed by the virtual character, onthe basis of scene description data newly provided in response to ascene or context in the moving picture content.

For example, when a scene in a climax arrives while the user is viewinga drama program, the interaction apparatus 120 may express such anaction of the virtual character that speaks for a user's emotion orsynchronizes with the user on the basis of the newly provided scenedescription data.

Moreover, when a subject team (team supported by the user) scores apoint or conversely concedes a point while the user is viewing a sportsprogram of baseball, soccer, or the like, the interaction apparatus 120may express such an action of the virtual character that speaks for auser's emotion or synchronizes with the user on the basis of the newlyprovided scene description data. For example, the interaction apparatus120 rejoices together or nods when the subject team is dominant, ormakes an interjection or changes the emotion or the facial expressionfollowing a progress of the moving picture content (which is a progressof a game, or the like).

Moreover, while the user is viewing a news program, the interactionapparatus 120 may express such an action of the virtual character thatintroduces a genre of interest to the user among domestic,international, social, economic, performing arts, sports, science,region, and the like on the basis of scene description data to beprovided for each section.

Moreover, the interaction apparatus 120 may express an action, by whichthe virtual character introduces a product corresponding to a CM to beinserted into the main part of the program, on the basis of scenedescription data not of the main part of the program to be broadcast orstreamed but to be newly provided at timing of the CM. At that time, theinteraction apparatus 120 may perform matching processing between auser's profile and the product, and change a way of recommending theproduct by the virtual character in response to a degree of interest ofthe user in the product, and the like.

Moreover, it is assumed that, no matter which type of the moving picturecontent may be being viewed, the virtual character basically responds toa question and a chat from the user.

As described above, in the interaction system 100 according to thepresent embodiment, the interaction apparatus 120 switches the virtualcharacter for each broadcast-type moving picture content or changes theemotional expression of the virtual character for each scene of themoving picture content on the basis of the scene description data to besequentially provided. Hence, the user views the broadcast-type movingpicture content together with the interaction apparatus 120, and canthereby enjoy the content more than in a case of viewing the contentalone.

The interaction apparatus 120 may control the action of the virtualcharacter in a standalone manner; however, as illustrated in FIG. 4, maycontrol the action of the virtual character by a cooperative operationof the interaction engine front end 410 and the interaction engine backend 420.

Moreover, it is arbitrary whether the interaction apparatus 120 operatesthe virtual character completely according to the scene description dataprovided from the outside, operates the virtual character with referenceto a description content of the scene description data to some extent,or autonomously operates the virtual character completely without beingrestricted by the description content of the scene description data. Forexample, the user may specify whether the interaction apparatus 120should follow the scene description data.

FIG. 5 illustrates an example of an operation sequence for theinteraction apparatus 120 to express an emotion while viewing thebroadcast-type moving picture content in the interaction system 100according to the present embodiment.

During a period of transmitting the broadcast-type moving picturecontent, the service side such as a broadcasting station and a streamdistribution server sequentially executes a trigger distribution tonotify that an event that can trigger the emotional expression and thelike of the virtual character has occurred in the moving picturecontent.

In such an operation sequence example illustrated in FIG. 5, during sucha period of distributing the broadcast-type moving picture content (SEQ501), the service side such as a broadcasting station and a streamdistribution server sequentially implements the trigger distribution forthe display apparatus 110, which receives (tunes and receives) themoving picture content, every time an event occurs in the moving picturecontent (SEQ 511, SEQ 512, . . . ).

That is, the service side such as a broadcasting station and a streamdistribution server implements the trigger distribution insynchronization with the occurrence of an event, which can be a trigger,in the broadcast-type moving picture content. In the case of a streamingservice, a delay time such as a network delay may occur from theoccurrence of an event in the moving picture content to the arrival ofthe trigger distribution at the display apparatus 110; however, it isassumed that the delay time is ignorable.

In the present embodiment, it is assumed that the service sideimplements the trigger distribution using means already defined by thestandard of the broadcasting service and the standard of the streamdistribution service.

For example, in the case of the broadcasting service, the broadcastingstation side can implement the trigger distribution for the displayapparatus 110 that tunes and receives a broadcast program usingStreamEvent or the like. Association of Radio Industries and Business(ARIB) defines, as a transmission format of a broadcast signal, a MovingPicture Expers Group (MPEG)-2 transport stream (TS) packet in which aPSI table is multiplexed together with a content such as video andspeech stream and data. A general-purpose event message is stored in adescriptor area of PMT that is one of such PSI tables, whereby thetrigger distribution using StreamEvent can be implemented.

Further, in the case of a content streaming service, the service sidesuch as a stream distribution server can implement the triggerdistribution to the display apparatus 110, which receives a contentstream, using WebSocket or the like. WebSocket is a protocol thatoperates on transmission control protocol (TCP) and defines that, once aserver and a client make a connection, all necessary communication isperformed on the connection using a dedicated protocol. An overhead ofWebSocket due to a header is small between a Web server and a client,and WebSocket can achieve highly efficient transmission. WebSocket isdefined as Request for Comments (RFC) 6455, “The WebSocket Protocol”.

Regardless of which broadcasting standard or communication standard isused as a basis, the service side such as a broadcasting station and astream distribution server transmits trigger data including informationregarding a location or acquisition method of the scene description data(described above) to the display apparatus 110 in the triggerdistribution (SEQ 511, SEQ 512, . . . ). The location or acquisitionmethod of the scene description data can be described in the format of,for example, a uniform resource ideitifier (URI) or a unoform resoucelocator (URL).

Upon receiving the trigger data, for example, in the format ofStreamEvent, WebSocket, or the like, the display apparatus 110appropriately converts, according to needs, the trigger data into a dataformat processable by the interaction apparatus 120, and transfers thetrigger data to the interaction apparatus 120 (SEQ 521, SEQ 522, . . .).

The trigger data does not include the scene description data itself, andincludes simple and small-capacity data that specifies the location oracquisition method of the scene description data in the format such asURI and URL. Hence, it is possible to transfer the trigger data usingthe simple communication means (described above) of only one-waycommunication between the display apparatus 110 and the interactionapparatus 120.

Upon receiving the trigger data from the display apparatus 110 (SEQ 521,SEQ 522, . . . ), the interaction apparatus 120 acquires scenedescription data from a predetermined website in accordance with thelocation or acquisition method specified in the trigger data in theformat such as URI and URL (SEQ 531, SEQ 532, . . . ).

Then, the interaction apparatus 120 autonomously drives the virtualcharacter, which is to be displayed on the sub-screen (display unit303), on the basis of the acquired scene description data (SEQ 541, SEQ542, . . . ).

Note that, at the time of transferring the trigger data to theinteraction apparatus 120 (SEQ 521, SEQ 522, . . . ), the displayapparatus 110 may also transmit, to the interaction apparatus 120, dataof the virtual character (hereinafter, also referred to as “distributioncharacter”) to be displayed on the sub-screen. The data of the virtualcharacter may be the one distributed as a multimedia content by thebroadcasting station or the stream distribution server along with themoving picture content, or may be the one installed in the displayapparatus 110 in advance. On the other hand, the interaction apparatus120 also has a virtual character (hereinafter, also referred to as “mycharacter”) provided in advance. It is assumed that the interactionapparatus 120 selects either the “my character” or the distributioncharacter and autonomously drives the selected character on thesub-screen.

The interaction apparatus 120 may control the action of the virtualcharacter in a standalone manner; however, as illustrated in FIG. 4, maycontrol the action of the virtual character by a cooperative operationof the interaction engine front end 410 and the interaction engine backend 420.

For example, when an event such as a scene in a climax occurs in a dramaprogram which the user is viewing, the trigger distribution is performedfrom the service side, and in response to this, the trigger data istransferred from the display apparatus 110 to the interaction apparatus120. Then, the interaction apparatus 120 can express such an action ofthe virtual character that speaks for a user's emotion or synchronizeswith the user on the basis of the newly acquired scene description data.

Moreover, when an event such as scoring, conceding, and a fine playoccurs while the user is viewing a sports program such as baseball orsoccer, the trigger distribution is performed from the service side, andin response to this, the trigger data is transferred from the displayapparatus 110 to the interaction apparatus 120. Then, on the basis ofthe newly acquired scene description data, the interaction apparatus 120can express an action of the virtual character, which follows a progressof such a game and includes rejoicing together or nodding at scoring andfine play of a supported team, and saddening and discouraging atconceding and an error of the supported team, for example.

Moreover, while the user is viewing a news program, the triggerdistribution is performed from the service side for each section such asdomestic, international, social, economic, performing arts, sports,science, and region, and in response to this, the trigger data istransferred from the display apparatus 110 to the interaction apparatus120. Then, the interaction apparatus 120 can express such an action ofthe virtual character that introduces a genre of interest to the user onthe basis of the scene description data acquired sequentially.

Moreover, the trigger distribution is performed from the service side atthe timing of the CM inserted in the main part of the broadcast orstreaming program, and in response to this, the trigger data istransferred from the display apparatus 110 to the interaction apparatus120. Then, the interaction apparatus 120 may perform the matchingprocessing between the user's profile and the product, and change theway of recommending the product by the virtual character in response toa degree of interest of the user in the product, and the like.

Moreover, it is assumed that, no matter which type of event an actionbeing executed by the virtual character may respond to, the virtualcharacter responds to a question and a chat from the user.

FIG. 6 illustrates an example of the scene description data. Regardingan event such as a goal scene of soccer, this drawing includes a starttime and end time of the event, a type of the event (“soccer”), acontent of the event (“goaled”, “team name”, “scoring status”), and thelike. Note that methods for expressing a scene from a video are various,and such scene description data as illustrated in FIG. 6 can beautomatically generated on the basis of an existing method.

FIG. 7 illustrates another example of the scene description data. Thisdrawing describes typical emotional expressions corresponding to auser's situation. Specifically, in an event type of “soccer”, thisdrawing describes that a happy or sad emotion is expressed when an eventof “goaled” occurs.

Note that the data describing specific information regarding the scene,which is as illustrated in FIG. 6, and the data suggesting the emotionalexpression corresponding to the scene, which is as illustrated in FIG.7, may be used as individual scene description data files, and in thetrigger data, a combination of these two data files may be specified asscene description data corresponding to the event. Alternatively, asingle scene description data file including both of such a data portionas illustrated in FIG. 6 and a data portion as illustrated in FIG. 7 maybe configured.

Moreover, a document structure or grammar for use in the scenedescription data is arbitrary, and FIGS. 6 and 7 illustrate merely anexample.

Furthermore, the interaction apparatus 120 may determine whether or notto use the scene description data to be provided every time the eventoccurs in the moving picture content that is being viewed at the time ofautonomously driving the virtual character or may determine how much touse the scene description data at that time. For example, theinteraction apparatus 120 may determine whether or not to use the scenedescription data or how much to use the scene description data inaccordance with a user's instruction. Moreover, in the scene descriptiondata, it may be specified whether the description content of the scenedescription data is essential or arbitrary.

FIG. 8 illustrates a basic operation example of the display apparatus110 in the format of a flowchart.

When a main power supply is turned on and the display apparatus 110starts, the display apparatus 110 checks whether or not the interactionapparatus 120 paired therewith is started (step S801).

Then, in a case where the interaction apparatus 120 is not started yet,the display apparatus 110 instructs the interaction apparatus 120 tostart (step S802), sets the interaction apparatus 120 to a receivingstate of being capable of receiving the trigger data and the like fromthe display apparatus 110 (step S803), and ends this process.

Note that, in the case of a system configuration of interacting with theuser using not the interaction apparatus 120 but an MR device or apet-type robot, a startup state of the MR device or the pet-type robotis checked in step 5801, and the MR device or the pet-type robot isstarted to be made into a receivable state in step 5802. Moreover, inthe case of a system configuration of displaying the virtual characteron a child screen provided in the screen of the display apparatus 110, astartup state of an application for automatically driving the virtualcharacter is checked, and in step 5802, this application is made into areceivable state.

FIG. 9 illustrates a procedure of a processing operation implemented bythe interaction apparatus 120 in a receiving state in the format of aflowchart.

The interaction apparatus 120 checks whether or not the trigger data isdistributed from the display apparatus 110 (step S901).

When the trigger data is not distributed from the display apparatus 110(No in step S901), the interaction apparatus 120 shifts to an autonomousconversation mode (step S902), and waits until the trigger data isdistributed while causing the virtual character to autonomously interactwith the user regardless of the content (scene) of the broadcast-typemoving picture content displayed on the screen of the display apparatus110.

On the other hand, when the trigger data is distributed from the displayapparatus 110 (Yes in step S901), the interaction apparatus 120 furtherchecks whether or not the data of the distribution character is alsodistributed from the display apparatus 110 (step S903).

In a case where the distribution character is not distributed from thedisplay apparatus 110 (No in step S903), the interaction apparatus 120selects the “my character” (step S904).

Moreover, in a case where the distribution character is distributed fromthe display apparatus 110 (Yes in step S903), the interaction apparatus120 presents the user with a character selection screen including a listof distribution characters selectable (in other words, capable of beingautonomously driven on the interaction apparatus 120) (step S905).

Then, when the user selects any of the distribution contents from thecharacter list screen (Yes in step S906), the interaction apparatus 120selects to display the distribution character selected by the user onthe display unit 303 and drive the selected distribution characterautonomously (step S907).

Moreover, in a case where the user does not select any of thedistribution contents from the character list screen (No in step S906),the interaction apparatus 120 selects the “my character” (step S904),and displays the “my character” on the display unit 303 and drive thesame autonomously.

Then, upon selecting either the “my character” or the distributioncharacter as described above, the interaction apparatus 120 shifts to ascene-following conversation mode (step S908). In this scene-followingconversation mode, the interaction apparatus 120 interacts with the userusing either the selected “my character” or distribution character onthe basis of the scene description data acquired in accordance with thelocation or acquisition method specified in the trigger data.

Thereafter, the interaction apparatus 120 has a conversation with theuser using the selected “my character” or distribution character untilthe next trigger data is distributed from the display apparatus 110.

FIG. 10 illustrates a procedure of a processing operation to beimplemented by the interaction apparatus 120 in a scene-followingconversation mode in the format of a flowchart.

Upon receiving the trigger data from the display apparatus 110 (Yes instep S1001), the interaction apparatus 120 acquires the scenedescription data in accordance with the location or acquisition methodspecified in the trigger data (step S1002).

Subsequently, the interaction apparatus 120 checks whether or not it ispossible to select any emotion data (step S1003).

When it is not possible to select the emotion data (No in step S1003),the interaction apparatus 120 drives the virtual character to perform anexpression based on recommended emotion data specified in the scenedescription data (step S1004).

On the other hand, in a case where it is possible to select the emotiondata (Yes in step S1003), the interaction apparatus 120 further checkswhether or not there is user data that fits a category of the emotiondata (step S1005).

Then, in a case where there is user data that fits the category of theemotion data (Yes in step S1005), the interaction apparatus 120 drivesthe virtual character to express an emotion according to the user data(step S1006).

Moreover, in a case where there is no user data that fits the emotiondata category (No in step S1005), the interaction apparatus 120 drivesthe virtual character to perform a neutral emotional expression (stepS1007).

INDUSTRIAL APPLICABILITY

The detailed description has been given above of the technologydisclosed in the present description with reference to the specificembodiment. However, it is self-evident that those skilled in the artcan modify the embodiment or allow substitution therefor within thescope without departing from the spirit of the technology disclosed inthe present disclosure.

The technology disclosed in the present disclosure can be suitablyapplied to a system for viewing the broadcast-type moving picturecontent. Although the embodiment applied to the MPEG-2 system has beenmainly described in the present description, the scope of application ofthe technology disclosed in the present description is not limitedthereto. For example, the technology disclosed in the presentdescription can be similarly applied to other moving picturedistribution systems such as MPEG-4 and MPEG-Dynamic Adaptive Streamingover HTTP (DASH).

In short, the technology disclosed in the present description has beendescribed in the form of exemplification, and the stated contents of thepresent description should not be interpreted in a limited manner. Inorder to determine the spirit of the technology disclosed in the presentdescription, the scope of claims should be taken into consideration.

Note that it is also possible for the technology disclosed in thepresent description to adopt such configurations as follows.

(1) An information processing apparatus including:

a receiving unit that receives a broadcast-type moving picture content;and

a notification unit that notifies a second device of data including alocation or acquisition method of information suggesting an action of avirtual character, the action corresponding to an event that occurs inthe moving picture content.

(2) The information processing apparatus according to the above (1), inwhich

the receiving unit receives the moving picture content to be broadcastor streamed.

(3) The information processing apparatus according to the above (1) or(2),

in which the notification unit notifies the second device of the data inresponse to a fact that the receiving unit has receives a trigger of theevent. (4) The information processing apparatus according to the above(3), in which

the receiving unit receives the trigger of the event by StreamEvent orWebSocket.

(5) The information processing apparatus according to any one of theabove (1) to (4), in which

the information processing apparatus is a television receiver providedwith a display unit that displays the broadcast-type moving picturecontent.

(6) An information processing method including:

a receiving step of receiving a broadcast-type moving picture content;

a displaying step of displaying the moving picture content on a displayunit; and

a notifying step of notifying, in response to an event that occurs inthe moving picture content, a second device of data including a locationor acquisition method of information suggesting an action of a virtualcharacter.

(7) An information processing apparatus including:

a receiving unit that receives, from a first device, a notice of dataincluding a location or acquisition method of information suggesting anaction of a virtual character, the action corresponding to an event thatoccurs in a broadcast-type moving picture content; and

a control unit that acquires the information on the basis of the dataissued as the notice from the first device, and controls driving of thevirtual character on the basis of the acquired information.

(8) The information processing apparatus according to the above (7), inwhich

the receiving unit receives the notice from the first device as atelevision receiver that receives the broadcast-type moving picturecontent.

(9) The information processing apparatus according to the above (7) or(8), further including a display unit capable of two-dimensional orthree-dimensional display, in which

the control unit controls the driving of the virtual character to bedisplayed using the display unit.

(10) The information processing apparatus according to the above (7) or(8), in which

the control unit displays the virtual character using an MR device.

(11) The information processing apparatus according to the above (7) or(8), in which

the control unit displays the virtual character on a child screen of thefirst device as a television receiver that receives the broadcast-typemoving picture content.

(12) The information processing apparatus according to the above (7) or(8), in which

the control unit controls driving of a pet-type robot on the basis ofthe acquired information.

(13) The information processing apparatus according to any one of theabove (7) to (12), in which

the control unit acquires the data including position information of theevent in the moving picture content, a type of the event, and a contentof the event.

(14) The information processing apparatus according to any one of theabove (7) to (13), in which

the control unit acquires the data including information regarding anemotional expression of the virtual character for the event.

(15) The information processing apparatus according to any one of theabove (7) to (14), in which

the receiving unit further receives data of the virtual character fromthe first device.

(16) An information processing method including:

a receiving step of receiving, from a first device, a notice of dataincluding a location or acquisition method of information suggesting anaction of a virtual character, the action corresponding to an event thatoccurs in a broadcast-type moving picture content;

an acquiring step of acquiring the information on the basis of the dataissued as the notice from the first device; and

a controlling step of controlling driving of the virtual character onthe basis of information acquired in the acquiring step.

(17) An information processing system including:

a first device that receives a broadcast-type moving picture content;and

a second device that controls driving of a virtual character, in which

in response to receiving a trigger of the event, the first devicenotifies the second device of data including a location or acquisitionmethod of information suggesting an action of the virtual character, theaction corresponding to an event that occurs in the moving picturecontent, and

the second device acquires the information on the basis of the dataissued as a notice from the first device, and controls driving of thevirtual character.

(18) A computer program written in a computer-readable format to cause acomputer to function as:

a receiving unit that receives a broadcast-type moving picture content;and

a notification unit that notifies a second device of data including alocation or acquisition method of information suggesting an action of avirtual character, the action corresponding to an event that occurs inthe moving picture content.

(19) A computer program written in a computer-readable format to cause acomputer to function as:

a receiving unit that receives, from a first device, a notice of dataincluding a location or acquisition method of information suggesting anaction of a virtual character, the action corresponding to an event thatoccurs in a broadcast-type moving picture content;

a control unit that acquires the information on the basis of the dataissued as the notice from the first device; and

a control unit that controls driving of the virtual character on thebasis of the acquired information.

REFERENCE SIGNS LIST

-   100 Interaction system-   110 Display apparatus-   120 Interaction apparatus-   201 Control unit-   202 Bus-   203 Storage unit-   204 Communication interface (IF) unit-   205 Extension interface (IF) unit-   206 Tuner/demodulator unit-   207 Demultiplexer-   208 Video decoder-   209 Speech decoder-   210 Superimposed character decoder-   211 Subtitle decoder-   212 Subtitle synthesis unit-   213 Data decoder-   214 Cache unit-   215 Application (AP) control unit-   216 Browser unit-   217 Sound source unit-   218 Video synthesis unit-   219 Display unit-   220 Speech synthesis unit-   221 Speech output unit-   222 Operation input unit-   301 Processing unit-   302 Communication unit-   303 Display unit-   304 Speech input unit-   305 Speech output unit-   306 Sensor unit-   307 Recording unit-   410 Interaction engine front end-   420 Interaction engine back end-   421 Speech recognition unit-   422 Semantic analysis unit-   423 Speech synthesis unit-   424 Control unit-   430 External agent service-   431 Speech recognition unit-   432 Semantic analysis unit-   433 Processing unit-   434 Speech synthesis unit-   440 Speech agent device-   441 Speech input unit-   442 Speech output unit

1. An information processing apparatus comprising: a receiving unit thatreceives a broadcast-type moving picture content; and a notificationunit that notifies a second device of data including a location oracquisition method of information suggesting an action of a virtualcharacter, the action corresponding to an event that occurs in themoving picture content.
 2. The information processing apparatusaccording to claim 1, wherein the receiving unit receives the movingpicture content to be broadcast or streamed.
 3. The informationprocessing apparatus according to claim 1, wherein the notification unitnotifies the second device of the data in response to a fact that thereceiving unit has receives a trigger of the event.
 4. The informationprocessing apparatus according to claim 3, wherein the receiving unitreceives the trigger of the event by StreamEvent or WebSocket.
 5. Theinformation processing apparatus according to claim 1, wherein theinformation processing apparatus is a television receiver provided witha display unit that displays the broadcast-type moving picture content.6. An information processing method comprising: a receiving step ofreceiving a broadcast-type moving picture content; a displaying step ofdisplaying the moving picture content on a display unit; and a notifyingstep of notifying, in response to an event that occurs in the movingpicture content, a second device of data including a location oracquisition method of information suggesting an action of a virtualcharacter.
 7. An information processing apparatus comprising: areceiving unit that receives, from a first device, a notice of dataincluding a location or acquisition method of information suggesting anaction of a virtual character, the action corresponding to an event thatoccurs in a broadcast-type moving picture content; and a control unitthat acquires the information on a basis of the data issued as thenotice from the first device, and controls driving of the virtualcharacter on a basis of the acquired information.
 8. The informationprocessing apparatus according to claim 7, wherein the receiving unitreceives the notice from the first device as a television receiver thatreceives the broadcast-type moving picture content.
 9. The informationprocessing apparatus according to claim 7, further comprising a displayunit capable of two-dimensional or three-dimensional display, whereinthe control unit controls the driving of the virtual character to bedisplayed using the display unit.
 10. The information processingapparatus according to claim 7, wherein the control unit displays thevirtual character using an MR device.
 11. The infounation processingapparatus according to claim 7, wherein the control unit displays thevirtual character on a child screen of the first device as a televisionreceiver that receives the broadcast-type moving picture content. 12.The information processing apparatus according to claim 7, wherein thecontrol unit controls driving of a pet-type robot on a basis of theacquired information.
 13. The information processing apparatus accordingto claim 7, wherein the control unit acquires the data includingposition information of the event in the moving picture content, a typeof the event, and a content of the event.
 14. The information processingapparatus according to claim 7, wherein the control unit acquires thedata including information regarding an emotional expression of thevirtual character for the event.
 15. The information processingapparatus according to claim 7, wherein the receiving unit furtherreceives data of the virtual character from the first device.
 16. Aninformation processing method comprising: a receiving step of receiving,from a first device, a notice of data including a location oracquisition method of information suggesting an action of a virtualcharacter, the action corresponding to an event that occurs in abroadcast-type moving picture content; an acquiring step of acquiringthe information on a basis of the data issued as the notice from thefirst device; and a controlling step of controlling driving of thevirtual character on a basis of information acquired in the acquiringstep.
 17. An information processing system comprising: a first devicethat receives a broadcast-type moving picture content; and a seconddevice that controls driving of a virtual character, wherein in responseto receiving a trigger of an event, the first device notifies the seconddevice of data including a location or acquisition method of informationsuggesting an action of the virtual character, the action correspondingto the event that occurs in the moving picture content, and the seconddevice acquires the information on a basis of the data issued as anotice from the first device, and controls driving of the virtualcharacter.